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These are the notes for the Clay Mathematics Institute Senior Scholar 
Lecture which was delivered by Bernd Sturmfels in Park City, Utah, on July 
22, 2004. The topic of this lecture is the "tropical approach" in mathemat- 
ics, which has gotten a lot of attention recently in combinatorics, algebraic 
geometry and related fields. It offers an an elementary introduction to this 
subject, touching upon Arithmetic, Polynomials, Curves, Phylogenetics and 
Linear Spaces. Each section ends with a suggestion for further research. The 
bibliography contains numerous references for further reading in this field. 

The adjective "tropical" was coined by French mathematicians, including 
Jean- Eric Pin ^\ , in the honor of their Brazilian colleague Imre Simon |23] , 
who was one of the pioneers in min-plus algebra. There is no deeper meaning 
in the adjective "tropical" . It simply stands for the French view of Brazil. 

1 Arithmetic 

Our basic object of study is the tropical semiring (M U {oo},©,0). As a 
set this is just the real numbers M, together with an extra element oo which 
represents infinity. However, we redefine the basic arithmetic operations of 
addition and multiplication of real numbers as follows: 

X ® y := min(x,y) and x Q y := x + y. 

In words, the tropical sum of two numbers is their minimum, and the tropical 
product of two numbers is their sum. Here are some examples of how to do 
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arithmetic in this strange number system. The tropical sum of 3 and 7 is 3. 
The tropical product of 3 and 7 equals 10. We write this as follows: 

3 e 7 3 and 3 7 = 10. 

Many of the familar axioms of arithmetic remain valid in tropical mathemat- 
ics. For instance, both addition and multiplication are commutative: 

X ® y — y ® X and x (D y — y (D x. 

The distributive law holds for tropical addition and tropical multiplication: 

X Q {y <S) z) — X Q y ® X Q z. 

Here is a numerical example to show distributivity: 

3 (7 © 11) = 30 7 = 10, 
3 07 © 3 11 = 10 © 14 = 10. 

Both arithmetic operations have a neutral element. Infinity is the neutral 
element for addition and zero is the neutral element for multiplication: 

X ® oo — X and x = 0. 

Elementary school students tend to prefer tropical arithmetic because the 
multiplication table is easier to memorize, and even long division becomes 
easy. Here are the tropical addition table and the tropical multiplication table: 
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But watch out: tropical arithmetic is tricky when it comes to subtraction. 
There is no x which we can call "10 minus 3" because the equation 3®a; = 10 
has no solution x at all. To stay on safe ground, in this lecture, we shall 
content ourselves with using addition © and multiplication only. 
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It is extremely important to remember that "0" is the muhiphcatively 
neutral element. For instance, the tropical Pascal's triangle looks like this: 































































The rows of Pascal's triangle are the coefficients appearing in the Binomial 
Theorem. For instance, the third row in the triangle represents the identity 



Of course, the zero coefficients can be dropped in this identity: 

{x®yf = e x'^y e xy^ y^. 
Moreover, the Freshman's Dream holds for all powers in tropical arithmetic: 

{x®yf = ® y^. 

The validity of the three displayed identities is easily verified by noting that 
the following equations hold in classical arithmetic for all y e M: 

3-min{x, y} = mm{3x,2x + y,x + 2y,3y} — min{3a;,3y}. 

Research problem: The set of convex polyhedra in can be made into 
a semiring by taking as "Minkowski sum" and as "convex hull of the 
union". A natural subalgebra is the set of all polyhedra which have a fixed 
recession cone C. If n = 1 and C = ]R>o then we get the tropical semir- 
ing. Develop linear algebra and algebraic geometry over these semirings, and 
implement efficient software for doing arithmetic with polyhedra when n < 4. 

2 Polynomials 

Let Xi,X2, ■ ■ ■ ,Xnhc variables which represent elements in the tropical semir- 
ing (M U {oo}, 0, 0). A monomial is any product of these variables, where 



{x®y)^ 



{x®y)(D{x®y)Q{x® y) 
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repetition is allowed. (Technical note: we allow negative integer exponents.) 
By commutativity, we can sort the product and write monomials in the usual 
notation, with the variables raised to exponents: 

X2 Q Xi Q Xs Q Xi Q X4 Q X2 Q Xs Q X2 — x\x\x\xi. 

A monomial represents a function from to M. When evaluating this 
function in classical arithmetic, what we get is a linear function: 

X2 + Xi ^- X2, + Xi ^- Xi ^- X2 + X2, + X2 = 2xi + 8X2 + + 3^4- 

Every linear function with integer coefficients arises in this manner. 
Fact 1. Tropical monomials are the linear functions with integer coefficients. 
A tropical polynomial is a finite linear combination of tropical monomials: 

p{xi,...,Xn) = aQx'^x'i ■■■x'^ e hQ x^^x^2 ■ ■ -^t © ••• 

Here the coefficients a,b, . . . are real numbers and the exponents ii,ji,... are 
integers. Every tropical polynomial represents a function — > M. When 
evaluating this function in classical arithmetic, what we get is the minimum 
of a finite collection of linear functions, namely, 

p{xi,...,Xn) = min(a + iia;iH \- inXn , b + jiXi -\ \- jnXn , ■ ■ ■) 

This function p : ^ M has the following three important properties: 

• p is continuous, 

• p is piecewise-linear, where the number of pieces is finite, and 

• pis concave, i.e., p{^^) > l{p{x) + p{y)) for all x,y e R". 

It is known that every function which satisfies these three properties can be 
represented as the minimum of a finite set of linear functions. We conclude: 

Fact 2. The tropical polynomials in n variables precisely the 

piecewise-linear concave functions on M" with integer coefficients. 
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Figure 1: The graph of a cubic polynomial and its roots 



As a first example consider the general cubic polynomial in one variable x, 

p{x) = aQx^®bQx'^®cQx®d. (1) 

To graph this function we draw four lines in the {x, y) plane: y = 3x + a, 
y = 2x + b, y = x + c and the horizontal line y = d. The value of p{x) is the 
smallest y-value such that (x, y) is on one of these four lines, i.e., the graph 
of p{x) is the lower envelope of the lines. All four lines actually contribute if 

b-a<c-b<d-c. (2) 

These three values of x are the breakpoints where p{x) fails to be linear, and 
the cubic has a corresponding factorization into three linear factors: 

p{x) = aQ{x ® {b-a))Q{x ® {c-b))Q{x ® (d-c)). (3) 

See Figure 1 for the graph and the roots of the cubic polynomial p{x). 

Every tropical polynomial function can be written uniquely as a tropical 
product of tropical linear functions (i.e., the Fundamental Theorem of Algebra 
holds tropically) . In this statement we must underline the word "function" . 
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Distinct polynomials can represent the same function. We are not claiming 
that every polynomial factors into linear functions. What we are claiming is 
that every polynomial can be replaced by an equivalent polynomial, repre- 
senting the same function, that can be factored into linear factors, e.g., 

® l7Qx ® 2 = x^©l0x©2 = {x Q) if. 

Unique factorization of polynomials no longer holds in two or more variables. 
Here the situation is more interesting. Understanding it is our next problem. 

Research problem: The factorization of multivariate tropical polynomials 
into irreducible tropical polynomials is not unique. Here is a simple example: 

(0 0X0 0) (O0t/0O) (0 0x0 2/0 0) 
= (O0x0y0O0x0O)0(O02;0?/0O0?/0O). 

Develop an algorithm (with implementation and complexity analysis) for 
computing all the irreducible factorizations of a given tropical polynomial. 
Gao and Lauder ^2] have shown the importance of tropical factorization for 
the problem of factoring multivariate polynomials in the classical sense. 

3 Curves 

A tropical polynomial function p : — M is given as the minimum of 
a finite set of linear functions. We define the hypersurface Ti.{p) to be the 
set of all points x G at which this minimum is attained at least twice. 
Equivalently, a point x G lies in Ti.{p) if and only if p is not linear at x. 
For example, if n = 1 and p is the cubic in with the assumption (j2)), then 

'^(p) = {6 — a, c — 6, d — c}. 

Thus the hypersurface 7i(p) is the set of "roots" of the polynomial p{x). 
In this section we consider the case of a polynomial in two variables: 

Pix.y) = ^CijQx'Qy^. 

Fact 3. The tropical curve 'H{p) is a finite graph which is embedded in the 
plane M^. It has both bounded and unbounded edges, all edge directions are 
rational, and this graph satisfies a zero tension condition around each node. 
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The zero tension condition is the following geometric condition. Consider 
any node (x, y) of the graph and suppose it is the origin, i.e., (x, y) = (0, 0). 
Then the edges adjacent to this node lie on lines with rational slopes. On 
each such ray emanating from the origin consider the first non-zero lattice 
vector. Zero tension at {x, y) means that the sum of these vectors is zero. 

Our first example is a line in the plane. It is defined by a polynomial: 

y) = aQx®hQy®c where a,b,c E M. 

The curve 'H{p) consists of all points {x, y) where the function 

p : — > R , (x, ?/) I— > min (a + x, 6 + ?/, c) 

is not linear. It consists of three half-rays emanating from the point (a;, y) ~ 
{c — a,c — b) into northern, eastern and southwestern direction. 

Here is a general method for drawing a tropical curve Ti.{p) in the plane. 
Consider any term 7 y^ appearing in the polynomial p. We represent 
this term by the point {'~f,i,j) in M^, and we compute the convex hull of 
these points in M"^. Now project the lower envelope of that convex hull into 
the plane under the map — M^, (7,i,i) ^ {hi)- The image is a planar 
convex polygon together with a distinguished subdivision A into smaller 
polygons. The tropical curve Ti-ip) is the dual graph to this subdivision. 

As an example we consider the general quadratic polynomial 

p{x,y) = aQx^ ® hQxy ® cQy"^ ® dQx ® eQy ® f. 

Then A is a subdivision of the triangle with vertices (0, 0), (0, 2) and (2, 0). 
The lattice points (0, 1), (1,0), (1, 1) are allowed to be used as vertices in 
these subdivisions. Assuming that a, b, c,d,e, f eM. are general solutions of 

2b<a + c, 2d<a + f , 2e<c + f, 

the subdivision A consists of four triangles, three interior edges and six 
boundary edges. The curve 7i(p) has four vertices, three bounded edges and 
six half-rays (two northern, two eastern and two southwestern). In Figure 2, 
Ti.{p) is shown in bold, and the subdivision A is shown in thin lines. 

Fact 4. Tropical curves intersect and interpolate like algebraic curves do. 

1. Two general lines meet in one point, a line and a quadric meet in two 
points, two quadrics meet in four points, etc. . . . 
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Figure 2: The subdivision A and the tropical curve 



2. Two general points lie on a unique line, five general points lie on a 
unique quadric, etc... 

For a general discussion of Bezout's Theorem in tropical algebraic geom- 
etry, and for many pictures illustrating Fact 4, we refer to the article 



Research problem: Classify all combinatorial types of tropical curves in 
3 -space of degree d. Such a curve is a finite embedded graph of the form 



C 



n{pi) n H{p2) n ■■■ nn{pr) c 



where the pi are tropical polynomials, C has d unbounded parallel halfrays in 
each of the four coordinate directions, and all other edges of C are bounded. 



4 Phylogenetics 

An important problem in computational biology is to construct a phylogenetic 
tree from distance data involving n taxa. These taxa might be organisms or 
genes, each represented by a DNA sequence. For an introduction to phy- 
logenetics we recommend Jl] and [21]. Here is an example, for n = 4, to 
illustrate how such data might arise. Consider an alignment of four genomes: 
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Human: ACAATGTCATTAGCGAT . . . 

Mouse: ACGTTGTCAATAGAGAT . . . 

Rat: ACGTAGTCATTACACAT . . . 

Chicken: GCACAGTCAGTAGAGCT . . . 

Prom such sequence data, computational biologists infer the distance between 
any two taxa. There are various algorithms for carrying out this inference. 
They are based on statistical models of evolution. For our discussion, we 
may think of the distance between any two strings as a refined version of the 
Hamming distance (= the proportion of characters where they differ). In our 
(Human, Mouse, Rat, Chicken) example, the inferred distance matrix might 
be the following symmetric 4 x 4-matrix: 
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The problem of phylogenetics is to construct a tree with edge lengths which 
represent this distance matrix, provided such a tree exists. In our example, 
a tree does exist. It is depicted in Figure 3. The number next to the each 
edge is its length. The distance between two leaves in the tree is the sum 
of the lengths of the edges on the unique path between the two leaves. For 
instance, the distance in this tree between "Human" and "Mouse" equals 
0.6 + 0.3 + 0.2 = 1.1, which is the corresponding entry in the 4 x 4-matrix. 

In general, considering n taxa, the distance between taxon i and taxon j is 
a positive real number dij which has been determined by some bio-statistical 
method. So, what we are given is a real symmetric n x n-matrix 
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We may assume that D is a metric, i.e., the triangle inequalities dik < 
dij + djk hold for all i,j, k. This can be expressed by matrix multiplication: 
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Figure 3: A Phylogenetic Tree 

Fact 5. The matrix D represents a metric if and only if D Q D = D. 

We say that D is a tree metric if there exists a tree T with n leaves, labeled 
1,2, ... ,n, and a positive length for each edge of T, such that the distance 
from leaf i to leaf j equals dij for all Tree metrics occur naturally in 
biology because they model an evolutionary process that led to the n taxa. 

Most metrics D are not tree metrics. If we are given a metric D that arises 
from some biological data then it is reasonable to assume that there exists 
a tree metric which is close to D. Biologists use a variety of algorithms 
(e.g. "neighbor joining") to construct such a nearby tree T from the given 
data D. In what follows we state a tropical characterization of tree metrics. 

Let X = (Xij) be a symmetric matrix with zeros on the diagonal whose 
(2) distinct off-diagonal entries arc unknowns. For each quadruple {i,j, k, 1} 
C {1, 2, . . . , n} we consider the following tropical polynomial of degree two: 

Pijki = XijQXki © XikQXji © XaQXjk. (4) 

This polynomial is the tropical Grassmann-Pliicker relation. It defines a 
hypersurface Ti.{pijki) in the space 1^(2). The tropical Grassmannian is the 
intersection of these (^) hyp ersurf aces. This is a polyhedral fan, denoted 

Gr2,n = n HiPijM) C ]R(^). 

l<i<j<k<l<n 
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Fact 6. A metric D on {1,2, ... ,n} is a tree metric if and only if its negative 
X = —D is a point in the tropical Grassmannian Gr2,n- 

The statement is a reformulation of the Four Point Condition in phylo- 
genetics, which states that D is a tree metric if and only if, for all 1 < 2 < 
j < k < I < n, the maximum of the three numbers Dij + Dki, Dik + Dji 
and Dii + Djk is attained at least twice. For X = —D, this means that the 
minimum of the three numbers Xij + Xki, Xik + Xji and Xu + Xjk is attained 
at least twice, or, equivalently, X G H^pijki)- The tropical Grassmannian 
Gr2,n is also known as the space of phylogenetic trees 0. The combinatorial 
structure of this beautiful space is well-studied and well-understood. 

Our research suggestion for this section concerns a certain reembedding 
of the tropical Grassmannian G'r2,„ into a higher- dimensional space. 

Research problem: Let n > 5 and consider a metric D on {1,2, ... ,n}. 
The triple weights of the metric D are defined as follows: 

Dijk := Dij + Dik + Djk {1 <i < j < k <n). 

This formula specifies a linear map ip : M^^) ^(s). Our problem is to 
characterize the image 4'{Gr2,n) of tree space Gr2^n under this linear map. 

This problem arises from phylogenies of sequence alignments in genomics. 
For such taxa, it can be more reliable statistically to estimate the triple 
weights Dijk rather than the pairwise distances Dij. Pachter and Speyer [TH] 
showed that tree metrics are uniquely determined by their triple weights, i.e., 
the map from Gr2,n onto ifj{Gr2^n) is a bijection. Find a natural system of 
tropical polynomials which define ip{Gr2^n) as a tropical subvariety of M^s). 

5 Linear Spaces 

This section is concerned with tropical linear spaces. Generalizing the notion 
of a line from Section 3, we define a tropical hyperplane to be a subset of 
which has the form where ^ is a tropical linear form in n unknowns: 

£{x) = ai Q Xn ® a2 Q X2 ® ■ ■ ■ ® an Q Xn- 

Here ai, . . . a„ are arbitrary real constants. Solving linear equations in trop- 
ical mathematics means computing the intersection of finitely many hyper- 
planes TC{i). It is tempting to define tropical linear spaces simply as intersec- 
tions of tropical hyperplanes. However, this would not be a good definition 
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because such arbitrary intersections are not always pure dimensional, and 
they do not behave the way linear spaces do in classical geometry. 

A better notion of tropical linear space is derived by allowing only those 
intersections of hyperplanes which are "sufficiently complete" . In what fol- 
lows we offer a definition which is a direct generalization of our discussion in 
Section 4. The idea is that phylogenetic trees are lines in tropical projective 
space, whose Pliicker coordinates are the negated pairwise distances dij. 

We consider the (^) -dimensional space M.^d) whose coordinates Xi^...i^ 
are indexed by d-element subsets {ii, . . . , id} of {1,2,..., n}. Let 5* be any 
(d — 2)-element subset of {1, 2, . . . , n} and let i, j, k and / be any four distinct 
indices in {1, . . . , n}\S'. The corresponding three-term Grassmann Pliicker 
relation ps,ijki is the following tropical polynomial of degree two: 

PS,ijkl = ^Sij & ^Skl © ^Sik&^Sjl © ^Sil&^Sjk- (5) 

We define the three-term tropical Grassmannian to be the intersection 

Grd.n = fl n{ps,rjkl) C m(^), 
S,i,j,k,l 

where the intersection is over all S, i, j, k, I as above. Note that in the special 
case d = 2 we have 5* = 0, the polynomial (0) is the four point condition in 
(jlj), and Gr^^n is the space of trees which was discussed in Section 4. 

We now fix an arbitrary point X = in the three-term tropical 

Grassmannian Gra.n- For any {d -\- l)-subset {jo,ji, . . . ,jd} of {1, 2, . . . , n} 
we consider the following tropical linear form in the variables xi, . . . ,Xn'- 

d 

®^^.-!r-U®^r. (6) 
r=0 

The means to omit jV- The tropical linear space associated with the point 
X is the following set: 

Here the intersection is over all ((i + l)-subsets {jo,ji, ■ ■ ■ ,jd} of {1, 2, . . . , n}. 

The tropical linear spaces are precisely the sets Lx where X is any point 
in Gr^^n C M^t*). The "sufficient completeness" referred to in the first para- 
graph of this section means that we need to solve linear equations using 



3on---jd 
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Cramer's rule, in all possible ways, in order for an intersection of hyper- 
planes to actually be a linear space. The definition of linear space given here 
is more inclusive than the one used in ^21 1201 121] ■, where Lx was required to 
come from ordinary algebraic geometry over a field with a suitable valuation. 

For example, a 3-dimensional tropical linear subspace of (a.k.a. two- 
dimensional plane in tropical projective (n — l)-space) is the intersection of 
(") tropical hyperplanes, each of whose defining linear form has four terms: 

We note that even the very special case when each coordinate of X is 
either (the multiplicative unit) or oo (the additive unit) is really interesting. 
Here Lx is a polyhedral fan known as the Bergman fan of a matroid j2l 12^] . 

Tropical linear spaces have many of the properties of ordinary linear 
spaces. First, they are pure polyhedral complexes of the correct dimension: 

Fact 7. Each maximal cell of the tropical linear space Lx is d- dimensional. 

Every tropical linear space Lx determines its vector of tropical Pliicker 
coordinates X uniquely up to tropical multiplication (= classical addition) 
by a common scalar. If L and V are tropical linear spaces of dimensions d 
and d' with d + d' > then L and L' meet. It is not quite true that two 
tropical linear spaces intersect in a tropical linear space but it is almost true. 
If L and L' are tropical linear spaces of dimensions d and d' with d + d' > n 
and f is a generic small vector then L fl (L' + v) is a tropical linear space 
of dimension d + d' — n. Following [201, it makes sense to define the stable 
intersection of L and L' by taking the limit of L fl {L' + v) as f goes to zero, 
and this limit will again be a tropical linear space of dimension d + d' — n. 

Research Problem: It is not true that a rf-dimensional tropical linear space 
can always be written as the intersection of n — d tropical hyperplanes. The 
definition shows that (^"^) hyperplanes are always enough. What is the 
minimum number of tropical hyperplanes needed to cut out any tropical 
linear space of dimension d in n-space? Are n hyperplanes always enough? 
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